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Abstract 

We consider queueing systems with n parallel queues under a Join the Shortest Queue (JSQ) 
policy in the Halfin-Whitt heavy traffic regime. We use the martingale method to prove that 
a scaled process counting the number of idle servers and queues of length exactly 2 weakly 
converges to a two-dimensional reflected Ornstein-Uhlenbeck process, while processes counting 
longer queues converge to a deterministic system decaying to zero in constant time. This limiting 
system is comparable to that of the traditional Halfin-Whitt model, but there are key differences 
in the queueing behavior of the JSQ model. In particular, only a vanishing fraction of customers 
will have to wait, but those who do will incur a constant order waiting time. 


1 Introduction. 

In this paper we consider queueing systems with many parallel servers under a heavy-traffic 
regime where the workload scales with the number of servers. Such systems are well understood 
when a global queue is maintained (i.e. M/M/n and similar queues }l'0l). but in many practical 
situations it may be advantageous to instead maintain parallel queues. Even if a global queue 
is itself not problematic, it may be necessary to keep queued customers close to the server who 
will eventually serve them. Consider for example an airport setting with arriving passengers 
who need to have their passports checked with one of a large number of passport controllers. In 
this situation having only a global queue can lead to significant walk times between the front 
of the queue and the server, leaving servers idle while they wait for their next customer. This 
idle time can be avoided by routing customers to individual queues for each server before earlier 
customers finish service. 

At the same time, a parallel scheme will necessarily allow servers to idle if their own queue 
is empty, even if customers are waiting in another queue, thus sacrificing some efficiency. To 
analyze this tradeoff, we will study a parallel queueing system in which each arriving customer 
is immediately routed to the queue containing the smallest number of customers, namely the 
Join the Shortest Queue (JSQ) policy. We consider the system in (Halfin-Whitt) heavy traffic 
by allowing the arrival rate X n to depend on n, letting the quantity (1 — A n )y/n have a non¬ 
degenerate limit, which we denote /3 > 0. Note that JSQ is a logical first step for understanding 
the tradeoffs involved in maintaining parallel queues, because Winston |20| proved that among 
policies immediately assigning customers to one of n < oo parallel queues, JSQ is optimal in the 
case of Poisson arrivals and exponential service times. That is, it maximizes, with respect to 
stochastic order, the number of customers served in a given time interval. Weber )18] extended 
this result to the more general class of service times with non-decreasing hazard rate, with no 
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assumptions on the arrival process. We will consider Poisson arrivals and exponential service 
times, and denote this system M/M/n-JSQ, distinguishing it from the traditional M/M/n 
system which maintains a global queue. 

Our main result describes the behavior of processes counting the number of idle servers 
and of queues with one customer waiting to enter service, along with auxiliary processes to 
count the number of longer queues. We prove that a system that initially has a fixed maximum 
queue length, appropriately scaled, converges weakly to a diffusion process as n approaches 
infinity. The coordinates of this diffusion process representing queues with more than 1 customer 
in service are deterministic and show that any longer queues present in the initial condition 
disappear in fixed time and do not form again. The coordinates corresponding to the number 
of idle servers and number of queues with exactly one customer waiting correspond to a two- 
dimensional reflected Ornstein-Uhlenbeck process. The entire limiting system will be defined in 
terms of a stochastic integral equation which we prove has a unique solution. This existence and 
uniqueness result is stated in Theorem [l] and the weak convergence result is stated in Theorem 
El which is our main result. 

As a proof technique, we will introduce a truncated variant of the M/M/n -JSQ system in 
which no queues of length longer than 2 are created, though such long queues are allowed in the 
initial condition. This system is more easily analyzed because it is finite dimensional and has 
limited interaction between many of the dimensions. In this truncated system, we show that 
the probability the system hits the truncation barrier decreases to zero as n approaches infinity, 
and thus in the limit the behavior of the truncated and untruncated systems are the same. 

One consequence of our result is that both the number of idle servers and the number of 
queues with exactly one customer waiting in the M/M/n- JSQ system are of the order O (yfn). 

Another feature of interest in this queueing system is the waiting time experienced by arriving 
customers. We prove that in the transient system the aggregate waiting time experienced by 
all customers is of the order O {y/n), and since the number of customers arriving in that time is 
order n, the waiting time per customer is O (l/i Jn). We also observe that any arriving customer 
who has to wait will incur a waiting time which is exponentially distributed with parameter 1, 
which is the service time of the customer in service when they enter a queue. Since any waiting 
customers must incur a constant order waiting time and the aggregate waiting time is O {y/n ), 
the fraction of customers who end up waiting is of the order O (1 /y/n). 

Next we review prior literature. The JSQ model was initially studied in the special case 
of 2 queues by Haight [5]. Kingman m proved stability results along with considering the 
stationary distribution of the system, and Flatto and McKean [5] also examine the stationary 
distribution. Further work on the n = 2 case includes bounds on the distribution of the number 
of people in the system by Halfin [5]. 

Foschini and Salz [5] consider diffusion limits for the heavy traffic case of the M/M/ 2-JSQ 
system, first proving that the queue-length processes for the two queues are identical in the 
limit and then deriving the limiting distribution. The limiting behavior of the waiting time 
is the same as the standard M/M/2 system in heavy traffic. Their results extend to the case 
of k parallel queues, but they do not consider the case where the number of queues grows as 
the traffic intensity increases. Zhang and Wang in and Zhang and Hsu J2T] look at a similar 
problem but drop the assumption of Poisson arrivals and exponential service times, deriving 
functional central limit theorems for the heavy traffic JSQ system with s servers. 

Thus our paper is the first study of JSQ systems in the asymptotic regime as n —> oo. 
Observe that for fixed A < 1 as n increases the probability of any customer arriving to find all 
servers busy will decrease to zero. In this case the JSQ nature of the system becomes irrelevant 
as customers will be assigned to an idle server immediately upon arrival. In particular we see 
that the limiting behavior of the system will essentially be that of the M/M /oo system, and 
thus it is of interest to consider this model in heavy traffic with A approaching unity. There has 
been some work on models similar to ours, most notably Tezcan |16| . who considers a variant 
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of the JSQ system with multiple pools of servers who each have their own queue. He uses a 
state-space collapse argument based on a framework of Dai and Tezcan [2] to prove diffusion 
limits under the Halfin-Whitt heavy traffic regime. In this case that regime has the number of 
servers and traffic intensity increasing together in the limit, but the number of pools of servers is 
fixed so the number of queues is also fixed. Therefore our model is similar to Tezcan’s but is not 
a special case of it. The state-space collapse argument implies that in the limit the system can 
be fully described by the total number of people in the system (rather than the queue lengths in 
the individual pools) and the diffusion limit of that process is very similar to the original Halfin 
and Whitt result [TO] , 

Another branch of analysis of JSQ-like queueing systems has focused on the “supermarket 
model” in which arriving customers join the shortest queue from among d randomly selected 
queues rather than from the entire system. It was proved independently by Mitzenmacher m 
and Vvedenskaya, Dobrushin, and Karpelevich E3 that this system achieves an exponential im¬ 
provement in expected waiting time over a system with n independent M/M/ 1 queues. Versions 
of this system where d depends on n are particularly closely related to our JSQ model, which 
essentially sets d = n. Brightwell and Luczak jT| give a set of d and A values depending on n 
for which they prove the steady-state system is usually in a particular state with most queues 
having the same (known) length. Their conditions require (1 — A) -1 > d, which excludes the 
d = n, (1 — A )\/n —> fd case considered in this paper. Dieker and Suk [4] prove fluid and diffusion 
limits for queue length processes when when d increases to infinity at a rate slower than n and 
with fixed A < 1. 

The remainder of the paper is laid out as follows: Section [5] defines the model and states our 
main result. In Section [3] we will prove Theorem [T] verifying that the integral representation 
of the limiting system is well defined. This result will also be the key to proving convergence 
via a continuous mapping theorem (CMT) argument. Section [I] will construct a representation 
of the system as a combination of martingales and reflecting processes. In Section [5] we will 
establish the convergence properties of these martingales, and then apply the CMT to translate 
the convergence of martingales to the convergence of the scaled queue length processes. This 
section will conclude our proof of Theorem [2] In Section [6] we discuss the waiting time in the 
M/M/n- JSQ system. We will conclude in Section [7] with a brief discussion of the implications 
of Theorem [2] and possible extensions. 

We use =£- to denote weak convergence, 1{A} to denote the indicator function for the event 
A, (x) + = max(x, 0). We let R + = R + U {oo} represent the extended positive real line. We will 
equip R + with the order topology, in which neighborhoods of oo are those sets which contain a 
subset of the form {x > a} for some a £ R. Most processes in this paper will live in the space 
D = D([0,oo),R) of right continuous functions with left limits mapping [0, oo) into R. We also 
consider D k = D([0, oo), R fc ) for k > 2, which we will treat as the product space D x D x • • • x D 
(see, e.g., j]j5] §3.3). We will denote the uniform norm 

IMI t = sup \x(s)\ 

0 <s<t 


for x £ D and the max norm 


||(xi,...,x fe )|| t = max HaJillt 

1<1<AC 

for x £ D k . Similarly we will use the max norm 

161 = max Itjl 
Ki<k 


for b £ R fc . 
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2 The model and the main result. 


We consider a M/M/n- JSQ queueing system with n servers where each server maintains a 
unique queue, with service proceeding according to the first-in-first-out discipline. Service time 
is exponentially distributed at each server, with the rate fixed at 1. Arrivals occur in a single 
stream, as a Poisson process with rate A n n, where 0 < X n < 1 and 

lim v^(l — A n ) = /? (2.1) 

n—> oo 

for fixed /3 > 0. Upon arrival, each customer is routed to the server with the shortest queue. In 
the event of a tie, one of the options is selected uniformly at random. 

The state of the system will be represented via the process Q n {t) = (Q"(t), Q 2 (t),.. .), with 
Q"(t) representing the number of queues with at least i customers (including any customer in 
service) at time t > 0. We note that for the system as described we have 

n>Q?{t)>Q2{t)>---> 0, Vf > 0, (2.2) 

and that we can recover the number of queues with exactly i customers in service via the quantity 
QUt)-Qi+ 1 (i), including the number of idle servers n — Q\(t). 

To state our weak convergence results, we also introduce a scaled version X n (t) of this 
process defined as 

X?(t) = 11 and X?(t) = i > 2. (2.3) 

V« \Jn 

The i = 1 case is treated differently because the number of queues with length 1 behaves 
differently than the number of queues of all larger lengths. In particular, there will be O (y/n) 
idle servers, and thus the number of servers with at least one customer in service will be order 
n. 

Our diffusion limit will be the solution to a system of k integral equations for some k > 3, so 
we first introduce this system and prove that it has a unique solution. Furthermore, we prove that 
the system defines a continuous map from K+ x R fc x D k to D k x D 2 with respect to appropriate 

topologies. This continuity, along with the further fact that the function maps continuous 

functions to continuous functions, allows us to use the CMT to prove weak convergence once we 
show the weak convergence of the arguments. 

Theorem 1. Given integer k > 3, B £ M+, b £ and y £ D k , consider the following system: 


Xi (t) 

= h 

+ yi(t) + 

/ (-*i 

(s) -(- x 2 (s))ds - 


(2.4) 

X 2 (t) 

= b 2 

+ 2/2 (t) + 

[\-Xl 

Jo 

(s) + x 3 (s))ds + m(t) - 

u 2 {t), 

(2.5) 

Xi(t) 

= b x 

+ Vi(f) + 

'o 

[s) + Xj+i(s))ds, 3 < * 

<k- 1, 

(2.6) 

Xk{t) 

= b k 

+ Vk{t) + 

r‘-, k 

./n 

(. s)ds , 


(2.7) 

Xi(t) 

<0, 

0 < x 2 (t) < B , 

Xi(t) >0, t > 0, 


(2.8) 


with u\ and U 2 nondecreasing nonnegative functions in D such that 

l{xi(t) < 0}dui(f) = 0, 
l{x 2 (t) < B}du 2 (t) = 0. 
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Then (12.41) - (12.81) has a unique solution ( x,u ) € D k x D 2 so that there is a well defined function 
(/, g) : R+ x R fe x D k —> D k x D 2 mapping (B,b,y) into x = f(B,b,y) and u = g{B,b,y). 
Furthermore, the function (/, g) is continuous on R+ x R fc x D k with respect to the product 
topology when R+ is equipped with the order topology and D is equipped with the topology of 
uniform convergence over bounded intervals. Finally, if y is continuous, then so are x and u. 

We will prove this theorem in Section [3] One implication of Theorem [T| is that the limiting 
system we find in our main result below is well defined because, as we will see, it is an application 
of the function (/, g) with specific arguments b, y, and B = oo augmented with A/(f) = 0 for 
i > k. Note that B = oo implies U 2 = 0. Our main result is the following: 

Theorem 2. In the sequence of M/M/n-JSQ models described above, suppose there exists k 
and a random vector X(0) = (Xi(0), ... ,X k (0)) € such that 

xp(0) => X,(0) in R as n —t oo, 1 < i < k, (2.9) 

and X"(0) = 0 for i > k. Then for any t > 0, 

X™ => Xi in D as n —► oo, i> 1, 


where X\ < 0 and Xi > 0 for i > 2 are unique solutions in D of the stochastic integral equations 

X 1 (t)=X 1 (0) + V2W(t)~pt+ [ (-X 1 {s) + X 2 (s))ds-U 1 (t), (2.10) 

Jo 

x 2 (t) = X 2 (0) + C7i(t) + [ (—x 2 (s) + X 3 (s))ds, (2.11) 

Jo 

X i {t)=X i {0)+ f {-Xi(s) + X i+1 (s))ds, 3 <i < k — 1, (2.12) 


X k (t) 

Xi(t ) 


x k (0) + 


o 



-X k (s)ds, 


0, i > k + 1, 


(2.13) 

(2.14) 


where W is a standard Brownian motion and U\ is the unique nondecreasing nonnegative process 
in D satisfying 

l{Xi(f) < 0}dUi(t) = 0. (2.15) 

Remark 1. The integral equations (12.121) - (12.131) are deterministic and have an explicit solution: 



Xi(t) = e-* 



X k {t) = X k ( 0)e~ t . 


3 < i < k — 1, 


Thus the number of queues of length at least i for i > 3 decays exponentially in time. 

We note that condition (12.91) does place significant but not unreasonable restrictions on the 
starting state of the finite systems Q n . In particular, Q"(0) — n = O (y/n) so the number of 
customers initially in service must be sufficiently near n. Similarly, (I2.9|) requires <3 2 (0) = 
O {-\/n) and therefore Q™(0) = O {y/n) for 3 < i < k. Also note that the longest queue allowed 
in the initial condition has length k. 

Our result shows that the M/M/n-JSQ system in the heavy traffic limit becomes essentially 
a two-dimensional system. If queues with more than one customer waiting are present initially, 
they disappear and do not form again. There are O (y/n) idle servers and O (y/n) queues 
with exactly one customer, and the behavior of processes counting these correspond to a two- 
dimensional Ornstein-Uhlenbeck process. 
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3 Integral representation. 

We will now prove Theorem[l] showing that the representation of the limiting system in Theorem 
[2] is a valid and unique representation. We will also show that it defines a continuous map from 
R + x K. fe x D k to D k x D 2 . The continuity of the map in the topology of uniform convergence over 
bounded intervals will allow us to use the continuous mapping theorem (CMT) to demonstrate 
the convergence X" =>• X-, once we write X” in the appropriate integral form. 

Note that by using R + in the domain of this map we allow the upper barrier B for the 
function x 2 to take the value oo, which corresponds to there being no upper barrier on the i Jn 
scale. 

Our approach to proving Theorem |T| will involve two main lemmas, one dealing with the first 
two dimensions, where reflection plays an important role, and a one dimensional lemma that we 
will apply to the higher dimensions. 

3.1 Lower dimensions. 

To deal with the reflection terms in the first two dimensions of Theorem [Hit 
consider those dimensions completely decoupled from the rest of the system, 
will prove the following: 

Lemma 1. Given B £ R + , b £ R 2 , and y £ D 2 , consider 

xi(t) = bi + yi(t) + / (-xi(s) + x 2 {s))ds - ui(t), 

Jo 

x 2 (t) = b 2 + y 2 (t) + / (-x 2 (s))ds + m{t) - u 2 {t), 

Jo 

x±(t) <0, 0 < x 2 (t) < B, t > 0, 

with u\ and u 2 nondecreasing nonnegative functions in D such that 

l{xi(£) < 0}dui(£) = 0, 

l{x 2 (t) < B}du 2 (t) = 0. 

Then (13. II) - (13.31) has a unique solution ( x,u ) £ D 2 x D 2 so that there is a well defined function 
(f,g) : R+ x R 2 x D 2 —> D 2 x D 2 mapping ( B,b,y ) into x = f(B,b,y) and u = g(B : b,y). 
Furthermore, the function ( f,g ) is continuous. Finally, if y is continuous, then so are x and u. 



is convenient to 
To that end we 

(3.1) 

(3.2) 

(3.3) 


3.1.1 The reflection map. 

In several places we will make use of the well known one-dimensional reflection map for an upper 
barrier. Given upper barrier k £ R+, we let (<^ re ,^ K ) : D —> D 2 be the one-sided reflection map 
with upper barrier at k (see, e.g., [T9] §5.2 and §13.5). In particular for x £ D with x(0) < k 
we have z = ip K (y) > 0, z nondecreasing, 

x = fi^y) = y- z< k, 


and 


coo 

/ l{x < n}dz = 0. 

Jo 

Recall that these functions can be defined explicitly by 


i> K (x)(t) = sup (x(s) - k) + 

0 <s<t 


(3.4) 
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and 


</> K (x)(t) =x(t)~ 0 K (x)(t). (3.5) 

We will also make use of a slight variant of the usual Lipschitz condition for these functions to 

allow for different values of re. In particular, for x, x' £ D, re, re' £ R, and t>0we have 

\\i> K {x) - i]) K >{x') || t < \\x - x'\\ t + |re - re'|, (3.6) 

\\<I>k{x) - <t>K>{x ')||t < 2 \\x - x'\\ t + |re - re'|. (3.7) 

These follow straightforwardly from (13.41) and (13.51) . Note that for re = re' we recover the usual 
Lipschitz constants of 1 for 0 K and 2 for <f> K . 

We also define a trivial reflection map for re = oo by letting (0 oo,ipoo) = (e, 0) where e is the 
identity map. That is, the reflection map leaves the argument unchanged and the regulator is 
identically zero. We prove the following: 

Lemma 2. The function (<(>,'0) : R+ x D —> D 2 defined by (13.41) - (13.51) for finite k and by 
((fooiipoo) = (e, 0) for re = oo is continuous with respect to the product topology when R + is 
equipped with the order topology and D is equipped with the topology of uniform convergence 
over bounded intervals. 

Proof. By (13.61) - (13.71) the function is continuous at any finite re £ R+. For x £ D and x K £ D 
such that x K —> x as re —> oo, 

lim ||0 K (x' t )|| t = lim sup \if> K (x K )\ 

K-i-oo K->-oo 0 < s < t 

= lim sup (a: K (s) — re) + 

K-fOO o< s < t 

= sup lim (x K (s) — re) + 

0<s<t K ~)°° 

= 0, 

where we have made use of the fact that ||x|| t < oo. Therefore (f> K (x K ) if>oo(x) and by m we 
conclude 0 K (x K ) 4>oo(x). Thus the function is continuous at re = oo, completing the proof. □ 

With these facts about the reflection map in hand, we will now prove a result similar to 
Lemma [T] for a related system: 

Lemma 3. Given B £ R +J b £ R 2 and y £ D 2 , consider 

wi(t) = bi +yi(t) + f (-fi 0 (wi(s)) + (f)B(w 2 (s)))ds, (3.8) 

Jo 

w 2 (t) = b 2 +V2(t) + 0o (wi(t)) + [ (—<t>B(w 2 (s)))ds>0. (3.9) 

Jo 

Then (13.81) - (13.91) has a unique solution w £ D 2 so that there is a well defined function t; : 
R + x R 2 x D 2 —> D 2 mapping (B,b,y) into w = £(B,b,y). Furthermore, the function £ is 
continuous. Finally, if y is continuous, then so is w. 

Before proceeding with the proof we introduce a version of Gronwall’s inequality first proved 
by Greene [7] and proved in the form we use by Das [3j: 

Lemma 4 (Gronwall’s inequality). Let K\ and K 2 be nonnegative constants, let hi be real 
constants, and let /, g be continuous nonnegative functions for all t > 0 such that 

f(t) <Kx + hi f f(s)ds + h 2 [ g(s)ds, 

Jo Jo 

g(t ) < K 2 + h 3 f f(s)ds + h 4 f g(s)ds 
Jo Jo 
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for all t > 0. Then 

f(t ) < Me ht and g(t) < Me hi 

for all t it 0 where M = K\ + K 2 and h = max{/ii + / 13 , h 2 + / 14 }. In particular, if K\, /i 2 = 0, 
f/ien f(t),g(t ) = 0 /or all t. 

Proof of Lemma\3i We will show existence via a contraction mapping argument. First we will 
show that for t > 0 there exists a solution w = (wi, W 2 ) to the system of integral equations 


Wl{t) 


bi + Vi{t) + 



(-</> 0 (wi(s)) + (f B ( w 2 (s ) + ^o(wh(s)))) ds, 


u> 2 {t) = b 2 + y 2 {t) + [ (-</>s(w 2 (s) + ^o(^i(s)))) ds > 0 . 
Jo 

Once we have such a solution, it follows immediately that 

w = (wi, w 2 ) = {wi,w 2 + ipo{wi)) 


(3.10) 

(3.11) 


is a solution to 43JD-(& 

We first show that the map defined by the right hand side of (I3.10I) - (I3.11I) is a contraction 
for small enough t. We define T : D 2 —$■ D 2 by 


T(w)i(t) = bi +yi(t) + f (-^ 0(^1 («)) + 0b(w 2 (s) +^o(^i(s)))) ds, 
Jo 

T(w) 2 (t) = b 2 H- 2/2 (t) + [ (-^>b(w 2 (s) + il>o(wi(s)))) ds. 

Jo 


For w, v £ D 2 we have 


and 


||T(w)i-T(u)i|| t < f \\-<j)o(wi) + <l>o(vi)\\ s ds 
Jo 

+ / \\(fB(u>2+'<Po(u>l)) ~ <)>B(v2+'tl’o(Vl))\\ s ds 

Jo 

< 2 / ||u)i — Ui|| s ds 

Jo 

+ 2 ||w> 2 +ipo(u>i) - v 2 - V’o(«i)|| s ds 

Jo 

< 2t ||w>i — ui|| t + 2t ||u >2 — v 2 \\ t + / ||«>i — Ui|| s ds 

Jo 

< 2t\\wi - + 2 1\\w 2 - V 2 \\ t +t ||rhi - Ui|| t 

< 5f ||w> — v\\ t 

||^ 2 - 6 2 || t < / \\~<l ) B{u)2+1po('Wl)) + (t>B(V2+1po(vi))\\ s ds 

Jo 

< 2t\\w 2 + ipo(wi) -V 2 - ifo{vi)\\ t 

< 2t \ \u!2 - V 2 \\ t +t ||wi - Ui|| t 

< 3t 11to — u|L . 


We therefore conclude that 


\\T{w)-T{v)\\ t < 5i||u}-u|| t , 


(3.12) 

(3.13) 






so for to < i, T is a contraction on D([0,to],R 2 )- Therefore by the contraction mapping principle 
(see, e.g., [T5] p.220]), T has a unique fixed point w on D([0, to], R 2 ) such that T(w) = w. This 
fixed point solves (13.10l) - (13.11l) for t £ [0, to]. Now we extend the fixed point argument to 
t £ [fo,2fo], [2to,3<o], • • ■ and repeat to find a solution w to (13.10I) - (I3.11I) for t > 0. As noted 
above, this provides a solution w to (EU-dHU). 

To prove uniqueness of this solution, suppose w and w' are two solutions to (I3.8D - (13.9D . We 
consider 

-</>oK) + 4>o{w[) + 4 >b{w 2 ) - (/>b(w' 2 )\\ s ds 

<2 / (||u>i — wills + IK — ^ 2 11 s ) ds (3.14) 

Jo 

and 

\\4>b{.W 2 ) - <p B (w 2 )\\ s ds 
w' 2 \\ s ds. (3.15) 

To match the form of Gronwall’s inequality (Lemma [4]) we rewrite (13.151) as 

w 2 — w 2 \\ s ds 

and note that the right hand side is nonnegative so the inequality remains true as 

(\\w 2 - w' 2 \\ t - \\w! - w'iWt)* < 2 [ \\w 2 -w 2 \\ s ds. (3.16) 

JO 

We now define 


\W 2 — w' 2 || t — 11 KJl — w[\\ t < 2 I 

Jo 


K - W 2 \\ t < ||^o(wi) - Ip o(K)|lt + 

JO 

< ||wi — Kilt + 2 / j \w 2 
J o 



Ui(t) = ||wi - w [\\ t , 

u 2 (t) = (|K - 1*411 1 - |K - K|| t ) + 


and note 

IK — **4II S < u 2 (s) + Ui(s) s > 0. 
Then (13.141) . (13.161) . and (13.171) imply 

r t r t 


**i(t) < 4 / Mi(s)ds + 2 / U 2 (s)ds, 
Jo Jo 

** 2 (t) < 2 f ui(s)ds + 2 f u 2 (s)ds. 

Jo Jo 


Now by Gronwall’s inequality we have 

u\{t) = 0 and u 2 (t) = 0, 
so by the definition of ui and (13.171) we have 


(3.17) 


IK - Kilt = IK-«4ll t = ° 

for all t > 0 and therefore the solution w is unique. 
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We now establish the continuity of £. Suppose 

(B n ,b n ,y n ) -> (B,b,y) as n oo. 

Fix e > 0 and suppose w n and w satisfy (I3.8D - (13.91) for ( B n ,b n ,y n ) and (B,b,y), respectively. 
Choose N such that for all n > N, 

\b n -b\ + || y n - y|| t + ||</>b»(w 2 ) - </>b(w 2 )|| t < <5 

for some <5 > 0 which is yet to be determined. Note that such an N exists by Lemma [2] and the 
assumption B n —> B. We have 


and 


-&l + lli/ n -i/llt 

+ [ ||-<MK) + </>o(u!i) + - (j)B(w2)\\ s ds 

Jo 

<6+ f (2 IK - Wills + II 0B"K) - ^B"(w 2 )|| s 

+ ||0b»(u> 2 ) - 0b(w 2 )|| s )ds 

<5+ f (2 ||K — wi|| s + 2 ||K — w 2 || s + 6) ds 
Jo 

< 5(1 + t) + 2 / (UK - Wi|| s + IK - w 2 11 s ) ds 
Jo 

|K ~ W 2 ||t - <^(1 + t) + ||K ~ Will t + 2 [ IK - W 2 || s ds. 

Jo 


(3.18) 


(3.19) 


As in the uniqueness argument above, we will apply Gronwah’s inequality, with functions 


ui(t) = ||K — Wi|| t and u 2 (t) = (||K - w 2 || t - ||K — wi|| t ) + . 


Then we have 


u\(t) < d(l + t) + 4 f u\(s)ds + 2 f u 2 (s)ds, 

Jo Jo 

u 2 (t) < 6(1 +1) + 2 f ui(s)ds + 2 [u 2 (s)ds, 

Jo Jo 


so Gronwall’s inequality implies 

u\(t) <26(1+t)e et and u 2 (t) < 26(1 + t)e 6t 

and we have 

IK -wi|| t < 26(l + t)e 6t and ||K — w 2 || t < 4<5(1 + t)e 6t . 

We choose 6 = jp^ee -64 to establish the desired continuity. 

For the proof of continuity of w we note 

rt+s 


/ t+s 

| 4>b(w 2 (z)) - 4>o(wi(z))\dz 


and 


\4> B (w 2 (z))\dz 

The boundedness of w± and w 2 proved in Lemma [9] imply that w\ and w 2 are continuous if y i 
and y 2 are continuous. □ 


10 


We are now prepared to prove Lemma [TJ 

Proof of Lemma QJ Our key insight is to see that a solution is found by setting aq = </>o(uq), 
ui = if o(uq), X 2 = J>b{w 2 ), and U 2 = i/jb(w 2 ) where (uq,^) is the unique solution defined by 
Lemma [3] 

To see that it is unique, note that the conditions on u\ and U 2 imply that they can be written 
as ifo( z i) and V'-b( z 2 ) for some functions z±,Z 2 £ D. Then x± and X 2 are 4>o{zi) and J>b{z 2 ) for 
the same z\ and Z 2 ■ Then (j2.4ll - fl2.8l) imply that z = (^ 1 ,^ 2 ) must be a solution of (13.8M3.9ll . 
By Lemma [5] this solution is unique. In particular, this solution is 

xi = fiip.y) = (</>o °£i )(b,y), 
ui = 9i{b, y ) = (Po ° £i)(&, y), 

X 2 = f 2 (b, y) = (Pb o &){b, y), 

U 2 = 92{b, y) = (ifs 0 &)(b, y). 

The reflection maps (</>o ) '0o) and ((Pb-,'>Pb) are continuous in the uniform topology and also 
preserve continuity. Since £ also has these properties by Lemma [3j we conclude that (/, g) are 
continuous and preserve continuity. □ 


3.2 Higher dimensions. 

Because of the lack of reflection terms for i > 3, the higher dimensional terms in Theorem |T] 
behave in primarily one-dimensional ways. Therefore we will note an existence, uniqueness, and 
continuity result for a one-dimensional system that we will use in our proof of Theorem [Q This 
lemma is a special case of a slightly more general result proved by Pang, Talreja, and Whitt m 
Theorem 4.1], namely letting h(x) = —x. 

Lemma 5. Given b £ R, and y £ D, consider 


x(t) = b + y(t) + f —x(s)ds 

Jo 

x(t) >0, t > 0 


(3.20) 

(3.21) 


Then ( 13 . 201 ) - ( 13 . 211 ) has a unique solution x £ D so that there is a well defined function 
mapping (b,y) into x = f(b,y). Furthermore, the function ( f,g ) is continuous. 
Finally, if y is continuous, then so is x. 

With this lemma in hand, we can prove Theorem [l] 

Proof of Theorem{J\ We first prove existence, uniqueness, and preservation of continuity by 
induction on k > i > 3. 

As the base case, observe that Lemma [5] implies there exists a unique solution Xk (t) which 
preserves continuity of yk(t). 

As an induction hypothesis, suppose for some i > 3 there exist unique solutions Xi+i(t ),..., Xk{t) 
which are continuous if Vi+i , ■ ■ ■ ,yk are continuous, and consider Xi(t). 

We have the integral equation 

Xi{t) = bi + yi(t) + / {-Xi(s) + x i+ i(s))ds 
Jo 

Xi+i (s)ds + / —Xi(s)ds. 

Jo 


— bi + yi(t) + 
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By the induction hypothesis Xi+i(t) exists and is unique, so we let 

y{i) = yi(t)+ / x i+1 {s)ds 
J o 

and apply Lemma [5] with y = y to conclude that there exists a unique solution Xi(t). This 
solution is continuous if yi,..., yk are continuous because in that case that y(t) is continuous. 

By induction, we conclude that there exist unique solutions X 3 ,,Xk which preserve conti¬ 
nuity of 2 / 3 ,..., yk- From this, we can define 


3/2M = 2 / 2 (i) + / x 3 (s)ds 

J 0 

and apply Lemma[l]with yi = yi to complete the proof of existence, uniqueness, and preservation 
of continuity. 

To verify that the map (/, g) is continuous, suppose x n (t) and x(t) solve (I2.4H - II2.8H for 
(B n ,b n ,y n ) and ( B,b,y ), respectively, and further suppose ( B n ,b n ,y n ) — > (B,b,y). 

We again proceed by induction on k > i > 3. 

For the base case note Lemma [5] implies x 7 / —> Xk- 

As an induction hypothesis, suppose for some * > 3 we have x" —> Xj for i < j < k. 

By the induction hypothesis we have 

V?(t)+ [ x ?+i( s )ds —t Vi(t) + [ x i+1 (s)ds, 

Jo Jo 

and thus, again by Lemma [5J a;" —> Xj . 

By induction we conclude x™ —> Xi for 3 < i < n. This further implies 

2 / 2 W+ [ x 2(s)ds y 2 (t) + [ x 3 (s)ds, 

Jo Jo 

so Lemma Q] implies (xi,^) “^ ( x u x 2 ), and continuity is established. □ 

4 Truncation and martingale representation. 

To use Theorem [T] in a CMT argument, we want to write the process X n in the appropriate 
integral form. Instead of directly considering the full M/M/n- JSQ system, however, we will 
introduce a truncated variant which we will later show has the same behavior as X n in the 
limit. It is this truncated system that will be shown to take the integral form of Theorem [lj 

4.1 Truncation. 

An important feature of the behavior of the M/M/n- JSQ system is that queues with more than 
one customer waiting are formed only if all queues have at least one customer waiting already. 
One of our goals is to show that the number of queues with a customer waiting, is of the 
order O (y/n) and thus it is unlikely for longer queues to form. With this in mind, as a proof 
technique, we now introduce a truncated version of the system in which no queues with length 
greater than 2 are created, though they are allowed to exist in the initial condition. This system 
will be significantly easier to analyze and will be shown to have stochastic behavior exactly 
matching the untruncated system with high probability. See Figure Q] to see a sample path of 
an original untruncated system which starts with order 0 (1 Jn) queues of length 4. Note that 
the number of queues length 3 and length 4 decrease monotonically. 
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(a) Idle servers (b) Longer queues 

Figure 1: A simulated sample path of an M/M/n -JSQ system, showing the scaled number of idle 
servers (a) and queues of length at least two (Xg), three (X£), four (X2), and five (X£). Simulated 
with n = 10 5 , fd = 2.0. 


Now we make this more precise. Consider a system in which any arrival that would create 
a queue of length 3 or longer is rejected. That is, if an arrival occurs when all queues contain 
at least two customers, that arriving customer does not enter the system. We will denote this 
system Q n = (Q™, Q%, ■ ■ •)■ We also introduce scaled versions 

X? (t) = ~ n and Xp(t) = i > 2. (4.1) 

\Jn yjn 

It will also be convenient for us to adopt the condition from Theorem [2] that Q£ + 1 (0) = 0 
for some k > 2. Since queues of length 3 or longer are never created in this system, this implies 
Q?(t) = 0 for i > k + 1, and thus analysis of Q n (t) can be limited to the first k dimensions. 

We will now construct the truncated process X n (t) and show that it has the integral form 
in Theorem [T| Our representation will be similar to the first martingale representation of [T4] ; 
in particular it will rely upon random time changes of rate-1 Poisson processes. 

4.2 Random time change. 

We let A, Di for 1 < i < k be rate-1 Poisson processes and write 

Q?(f) = Qi (0) + A (A n nt) - Di ((g?(a) - <&(*)) <tej - (4.2) 

QW) = Q2( 0) + U?(t) - D 2 [J* (Q2(s) - Q 3 (s)) dsj - U 2 n (t), (4.3) 

QUt) = Q?( 0) - a (J* (q?(s) - Q? +1 (s)) ds^j , (4.4) 

QUt) = QUO) - A (j* Quads') , (4.5) 

where [/"(t) is the number of arrivals in [ 0 ,t] when every server has at least one customer, and 
U£(t) is the number of arrivals in [0,i] when every server has at least one customer and all n 
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servers have two customers. Formally, we define 


W) 

u?(t) 


1 |q"( s) = n j dA(\ n ns), 

1 |Qi (s) = ra, Q 2 ( s ) = raj dA(X n ns). 


(4.6) 

(4.7) 


We can understand (14.21) term-by-term: first we record the initial state of the system with Q”(0), 
then arrivals are counted at their full rate A n n. The D\ term represents departures, which occur 
as a Poisson process with rate equal to the number of customers in service. Since Q\ includes 
queues of length 1 and length 2, however, Q\ will only decrease when a customer departs a 
queue and leaves the server empty. Therefore the instantaneous rate at time s in the D\ term 
is Qi(s) — Q 2 (s), the number of queues of length exactly 1 at time s. Through the first three 
terms of (14.21) we have recorded what the value of Q\ would be if it were not constrained to 
be at most n , so the final term will represent this barrier. The process U ™ records any arrival 
which would increase Q\ above n, balancing the overcounting we get from A(A n nt). 

We can understand (|4.3I) in much the same way, with the key difference being in the arrival 
process. Since arriving customers will always join the shortest available queue, the number of 
length 2 queues will increase only when all servers are busy. Such arrivals are exactly recorded 
by Ui, so this will be the process we use to record potential increases to Q%. The process U% 
provides the upper barrier n on Q 2 . 

The remaining equations (|4.4D - (I4.5D are the same except that we record no arrivals, as our 
truncated approximation does not create queues of length 3 or longer. 

As in m Lemma 2.1], we can verify that this construction is well defined and generates an 
element of D k by conditioning on the starting state Q n ( 0) and processes A, Di then constructing 
recursively. 


4.3 Martingales. 

Because our approach to (|4.2D - (14.5D will be to apply the functional central limit theorem (FCLT) 
for Poisson processes, we will now rewrite the time changes of Poisson processes as time changes 
of scaled Poisson processes. To that end, we define scaled martingales 


Mo(t) = A=A ( X n nt ) - XnVnt, (4.8) 

Vra 

M?(t) = QT (Q?(a) - Q?+i(s)) ds^j (Q?(s) - QS+iW) ds, 1 <i < k, 

(4.9) 

= 1=D k [J Q^(s)ds' S j ~^J Q n k {s)ds. (4.10) 


Via an argument exactly analogous to that of in §7.1 of |T4] leading to Theorem 7.2 we obtain 
that M™ for 0 < i < k are square-integrable martingales with respect to an appropriate filtration. 
We note for later use that this argument also supplies the predictable quadratic variations 


(Mo) (t) = Ant, (4.11) 

(m?) it) = - [\q7(s) - Q7 +1 (s))ds, (4.12) 

(mt) (t) = i jf‘ Q n k (s)ds. (4.13) 
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We also define 


Then we have 


and 


= and K(t)= Um 


yfn 


\fn 


xm = 


Qi(t) - n 
\fn 

Q?(0)-n 1 

= - + A (X n nt) 

v« v n 

- -J=D X ( Aq?(s) - g2(a))da) - ^2 
V™ \J o / V™ 

= X"(0) + M 0 n (t) + A nV ^i - W) 


- M?(t) - y (Q?(s) - Q£(s)J ds 
= xm + M™(t) - M?(t) + XnVnt - V?(t) 

- v^t - f( ^" ( A~ n - ds 

Jo \ V n V n J 

= A”(0) + M 0 n (i) - M?Ct) - (1 - X n )y/nt 


- / (A?(s) - X 2 »( a ))d« - 
Jo 


(4.14) 


* 2 " (*) = ^ 2 n ( 0 ) + VT(t) - M 2 "(i) - £ (xs(a) - x 3 "(s)) ds - V?(i), (4.15) 

X?(t)=X?(0)-M?(t)- J* (xr(s)-xr +1 (s))ds, 3 < i < fc — 1, (4.16) 

X£(i) = Xjf (0) - M£(t) - f XZ(s)ds. (4.17) 

do 

At this point we can also note that (14.14I) - (14.17D put X n (t) in the integral form of Theorem 
m The only difference is the processes V n , which are not described in exactly the same way. 
We see, however, that by (14.61) we have 

/>CO 

0 = / l{Qi (s) < n}dUi(s) 

Jo 

l{A”(s) < 0}dtd"(s) 

1{X?{8) < o}dt> 1 n (s). (4.18) 

Similarly by (14.7|) we have 

p CO 

0 = / 1{<5i(s) < n or Q%(s) < n}dU£(t). 

Jo 
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which implies 


0 = 


HQS(s) < n}d,U2(t) 


i{X 2 "(s) < v^} dV 2 n (t). 


(4.19) 


Thus by (14T4P - (14T7P and (l4T8l) - (IXTT)l) X” is the unique solution of (f2Tjl - (fT8|t for b = X n (0), 
y l = M(f{t) — Mf(f) — (1 — A n )y/nt, i/i = —Mf{t), 2 < i < k, and B = sfn. Thus to apply the 
CMT it remains to prove the convergence of the martingales (ld.8l) - (14.10l) . 


5 Martingale convergence. 

We will now prove the convergence of Mf to Brownian motions. 

Lemma 6. For the sequences of scaled martingales Mf defined in Section \4-S\ we have the 
convergence 


(*%, Mi, M?,..., MS) =* (W lt W 2 , 0,..., 0) 


in D as n —^ oo, 


(5.1) 


where W\ and W 2 are independent standard Brownian motions. 

To prove this lemma we will rely upon the CMT and the FCLT for Poisson processes (US 
Theorem 4.2]), which we restate here convenience. 

Lemma 7. (FCLT for independent Poisson processes) If A and Di are independent rate-1 
Poisson processes and 

„ , . . C(nt) — nt 

M C ,nit) = 


y/n 


for C = A, Di, then 


(M A< n, M Dl , n ,..., M Dk , n ) => (Wi, W 2 , ■ ■ ■, Wfe+i) in D 


fc +1 


as n —> 00 


where Wi are independent standard Brownian motions. 

To apply Lemma [7] we will define random and deterministic time changes such that the 
martingales Mf can be written as a composition of a time change and the scaled Poisson 
processes Mc, n - Specifically, let 


*&A,n(t) — A n t, 


$Di 

®D k 


,n(t) = -[ QS{s)ds — — f Qi+i(s)ds, 
n Jo n Jo 

,n{t) = ~ f Qk(s)ds, 

n Jo 


(5.2) 

(5.3) 

(5.4) 


so that we have 


MS = M A , n °® A , n , M? = M Di , n o <& Di , n , 1 <i<k. 

To apply the CMT with the composition map o, we need to determine the limits of the time 
changes (I5.2D - (15.4D . 

First we note that (12.11) implies \ n —> 1, which in turn implies 

&A,n => e as n —> oo, (5-5) 
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where e is the identity function in D. 

Next we note that the terms of m interleave over i, so for 1 < i < k — 1 we have 

$Di,n => fi~ fi +1 as n -)> oo, 
where fi is the limit of $ Oi,n with 

1 [ d - 

$D i<n (t) = - Qi{s)ds. 

n Jo 

To find fi we will first show fluid limits for Qf. 

Lemma 8. Let 'F” for 1 < i < k be defined by 


= *>o. 


Then for 2 < i < k, 


U/" => w and \F" =£* 0 as n —»■ oo (5.6) 

where ui(t) = 1 /or t > 0 . 

The proof of this lemma is found in Section 15.11 To use Lemma [8] we define a continuous 
function h : D —> D by 

h(x)(t) = / x(s)ds 

Jo 

for t > 0. The $ 1)1 ,n = ho so by the CMT and Lemma [8] we know /1 = h o ui. Namely, 

flit) = [ Ids = t 

Jo 

so f 1 = e is the identity function in D. Therefore we have 

=> e as n —> 00 . 

For 2 < i < k we have /j(i) = Ods = 0 so / = 0 on D. We conclude that 

=> e as n —> 00 , 

and for 2 < * < k 


&Di,n => 0 as n —> 00 . 

Therefore once we establish Lemma [8] we can prove Lemma |6l 


(5.7) 

(5.8) 


Proof of Lemma \Q We apply the CMT with Lemma [T] and the limits (15.511 , (15.71) , and (15.81) to 
obtain 

{Mq , M", M2 = (Ma,„ o <f >A,n, Md 1 ,u ° M D2 ,n ° $D 2 ,n, • • ■ , M Dk ,n ° < &.D fc ,n) 

=> (Wi o e, W 2 o e, W 3 o 0 ,..., Wfc+i o 0 ) 

= (W 1 ,W 2 ,0,...,0). 

□ 
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5.1 Fluid limit. 


We will prove Lemma [8] by showing that X n is stochastically bounded in D. Namely, we will 
prove that the sequence of real-valued random variables X™ is tight for every t > 0 and 

i > 1. For a more complete discussion of stochastic boundedness as we use it here see §5 of El- 
The stochastic boundedness of Xf and X 2 will follow from the stochastic boundedness of 
Mq ', Mi, M 2 i and X^. To see this, we prove 


Lemma 9. Given (B n , X”(0), Yf 1 ) a random element o/R+ xl xD for each n > 1 and i = 1,2, 
recall that Lemma{J} implies that the system 


xrw = 
X\ut) = 
0 = 
0 = 


X"(0) + Y?{t) + f\-X ?(s) + x?(s))ds 

JO 

X 2 "( 0 ) + Y 2 n (t) + f (-X 2 (s))ds + V?(t) 
Jo 

i{x?(t) < o }dvr(t), 

l{X 2 "(t) < B n }dV 2 n (t), 



Kit), 
Kit) > o, 


has a unique solution {X n ,V n ). If the sequences (X"(0),n > 1) and iY™,n > 1) are stochasti¬ 
cally bounded for i = 1,2, t/ien the sequence (X", n > 1) is stochastically bounded in D. 

Note that we do not require boundedness for B n . 


Proof. We fix t > 0. We will establish the bound 


X r 


< 8e bt (|X"(0)| + ||F ra || t ). 


(5.9) 


from which the result follows. 

To show m , we will prove a similar bound for the unreflected process W n defined by 
Lemma [3l Then (15.911 will follow from the Lipschitz continuity of the reflection maps <f >o and 

4>b „ • 

Just as in Lemma[]]and Lemma [3] we write X”(t) = 0 o (bF"(t)) and X 2 {t) = ^B^iW^it)) 
where W”(t) and W 2 (t) satisfy 

w?(i) = x"(o) +17* (t) + f\-MWTis)) + MW2is)))ds, ( 5 . 10 ) 

Jo 

W?(t) = X 2 "(0) + y 2 "(i) + [ (-0 B -(W?( a )))da + ifo (W?(i)). (5.11) 

Jo 

We now use Gronwall’s inequality as stated in Lemma 31 Using the Lipschitz property for 
4’o,4 l B n , and V’o we have for t > 0 

\\w?\\ t < |XJ*(0)| + \\Y?\\ t + 2 f (\\w?\\ s + IIWTIUds, 

JO 

ira* < 1^2 (o)i + iii?n t + wMwniit + f \m\\ s ds. 

Jo 

Now we note that we have 

iiMWT)ii t <ira*- 
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We define 


and 




Mt) = (\m\\ t -\\wr\\ t ) + . 


Finally we note 

so we can write the inequalities 


\\W 2 \\ t < u 2 {t)+u 1 (t), 


ui(s)ds + 2 / u 2 (s)ds, 

Jo 

u 2 {t) < |X2(0)| + ||y 2 n || t + [ ui(s)ds+ f u 2 (s)ds. 

Jo Jo 

Let |AT rl (0)| + ||Y rl || t = K. Then Lemma H] implies 

u\(t) < 2 Ke 6t and u 2 {t) < 2 Ke 6t . 

From (15.1211 and the definitions of u\ and u 2 we obtain 

\\W?\\ t < 2Ke 6t and 11W 2 ”|| t < 4Ke 6t . 

Since </>o and 4>b « are Lipschitz continuous with constant 2 this implies 


«i(i)<l^r(0)| + ||iT 


(5.12) 


which proves (15.91) . 



< 4 Ke 6t 


and 



< 8Ke 6t , 


□ 


Note that this proof also provides the boundedness of w that we use in the proof of continuity 
of w in Lemma [31 and that it does not use any of the continuity properties proved using that 
boundedness. 

In our application of Lemma[9]we will have Y-{ 1 = M"’ 1 — M”’ 2 — (1 — A n )y/nt and Y 2 {t) = 
M”’ 2 (t) + f*X^(s)ds, so it remains to prove that f* Xg(s)ds and each martingale Mj?’ 1 is 
stochastically bounded. 

To show that f 0 X^(s)ds is stochastically bounded we need only show that X£ is stochasti¬ 
cally bounded. This follows from the fact that Q^iO) is stochastically bounded by assumption 
(12.91) and 




Qs(t) < QS( 0 ) 

y/n - yfn 


because no queues of length 3 or longer are ever created. An identical argument proves stochastic 
boundedness of XJ 1 for 4 < i < k. 

To prove the stochastic boundedness of these martingales we will use the following lemma 
from [23 : 


Lemma 10 ( |14| Lemma 5.8). Suppose that, for each n > 1, M n is a square integrable mar¬ 
tingale with predictable quadratic variation ( M n ). If the sequence of random variables ( M n )(T ) 
is stochastically bounded in R for each T > 0, then the sequence of stochastic processes M n is 
stochastically bounded in D. 

We now prove that the predictable quadratic variations of M" are stochastically bounded. 
In the case of Mq this is immediate since by (14.111) the quadratic variation is deterministic. 
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For M” we refer to (14.121) and apply crude bounds to see 


= - A Q^s) ~ QZ(s))ds 

1 n J o 


< - / Qi(s)ds 

™ Jo 

< -(Q?(0)+T(A„nf)). 
n 


It suffices to show stochastic boundedness of each term in the sum. For Q" 
assumption m - 

For A(X n nt) we note A ra —>• 1 so by the strong law of large numbers 
processes we have 


A(X n nt) 

n 


->• e(t) 


(0) this follows from 
(SLLN) for Poisson 


with probability 1, which implies stochastic boundedness, so we conclude that Mf is stochasti¬ 
cally bounded. 

For MJf we have 


(%} (t) < (o)+ 

< — (QUO) + Al(A„nt)), 

n 

and stochastic boundedness follows. 

We now return to the proof of Lemma [5J 

Proof of Lemma 0 We have for * < 2 < k that 

X? = an d X? = A 

Vn yjn 

are stochastically bounded. Therefore 

x n 

—f= => 0 in D as n — V oo. 

\Jn 

From the definition of X n this is equivalent to 

Q n O n 

Ti' = —- => w and T" = —— => 0 in D as n —> oo 
n n 


for 2 < i < k. □ 

5.2 Proof of Theorem |2J. 

Now that we have the convergence of the martingale processes M", we can apply the CMT to 
prove Theorem [2] 

Proof of Theorem [H We first show X n => X. 

In Theorem[Q in the pre-limit regime we set B n = i/n, bi = X"(0) for 1 < i < k, 

2/i (t) = Mq (i) - M"(t) - (1 - X n )y/nt, 
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and yi(t) = —M”(t) for 2 < i < k. Equations (14.181) and (14.191) show that V{ 1 and VJ 1 are 
appropriately acting as u i and U2 in the integral representation, so Xi{t) = X"(i) for 1 < i < k. 
For application of the CMT we need only determine the limits of B , b and y. We have B n —> oo, 
so in the limit U 2 = 0. 

By assumption we have 

xm^x k (o), 

so in the limiting system we let bi = W(0). Next we have by (12.11) and (15.11) 

M 0 n (f) - M?(t) - (1 - X n )y/nt => W^t) - W 2 {t) - fit 

= V2W{t)-pt , 


where W is a standard Brownian motion and = indicates equivalence in distribution. Another 
application of (15.11) implies 

-Mf => 0 

for 2 < i < k so in the limiting system we have y\{t) = y/2W(t) — (3t and yi(t ) = 0, for 2 < i < k. 

The CMT then implies that for 1 < * < fc, Xf => Xi in D as n —> oo where Xi is described 
by (12. 10D - (12. 15D . We can augment the truncated system with X™(t) = 0 for * > k and note that 
0 = Xp => Xi = 0 for such i. Therefore X n =£- X. 

Now we consider the untruncated system described by X n . By an argument like that Section 
[4j we have 

Qi(t) = Q?( 0) + A (A n nt) - D 1 ^ (Q?(a) - Q2(a)) da) - (5.13) 

QW) = Qm + U?_ 1 {t)-D i (^j\Q?{s)-Q^ 1 {s))ds^-U?{t), * > 2, (5.14) 

where (7™(t) is the number of arrivals in [0, t\ when every server has at least i customers. Note 
that we introduce extra rate one Poisson processes A for i > k. 

Now define 

t* = inf{f > 0 : Q 2 (t) = n} (5.15) 

and note that for t € [0, t*) we have t/"(t) = 0 for i > 2. This, along with Q"(0) = 0 for i > k, 
implies that for such t, the system (I5.13l) - (|5.14l) becomes 

Qi(t) = Q?(0) + A (A n nt) - A Qf (Q?(a) - Q2(a)) da) - 
Q 2 (t) = Q£(0) + U?(t) - D 2 Qf (Q£(a) - Q£(a)) da) - [/?(*), 

Qi(t) = Q?(0) - A (jf* (Q?(a) - Q? +1 (a)) da) , 

= Qfc(0) - A (jf* Qfe(s)ds) , 

which precisely matches (14.21) - (14.51) . Thus for t G [0,t*), X n [t) and X n (t) are identical. 

It only remains to show for alH > 0 that P(i* < t) —> 0 as n —> 00 . Because the systems are 
identical up to time t*. we can replace Q^it) in (15.151) with Q%(t) to see 


P(t* < t) = P ( sup Q%(s) > n ) = P ( sup X^s) > y/n 




0 <s<t 

sup X”(a) > C 

0 <s<t 


0 <s<t 
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for constant 0 < C < y/n. By the weak convergence X 2 => X 2 and the fact that jsup 0<s<t X 2 (s) > C 
is closed, we have 


lim sup P 

n 


sup X 2 (s) > 

.0 <s<t 



< 


sup X 2 (s) > C 

0 <s<t 


By continuity of probability we have 


lim P 

C—> oo 


sup X 2 (s) > C 

\o <s<t 


( sup X 2 (s) = oo I = 0. 
Vo <s<t ) 


We therefore have 


limsupP(t* <t)< lim lim sup P ( sup X 2 {s) > C I < lim 


C —^ oo 


\0<s<t 


C — y oo 


sup X 2 {s) > C =0 

o <s<t 


and thus P(i* < t) —> 0 as n —> oo. We conclude X n => X. 


□ 


6 Waiting time. 

An important performance measure of a queueing system is the expected time that customers 
will have to wait before entering service. In the M/M/n system with a single queue in the 
Halfin-Whitt regime, the expected waiting time is of the order 0(l/y/n). We will now show 
that the M/M/n -JSQ system has the same order of aggregate waiting time in the transient 
regime, and thus seems to have a minimal loss of efficiency as measured by waiting time. 

Notice that our representation of the system allows us to directly consider the total time 
any customers in the system will wait. In particular, the instantaneous number of customers 
waiting to be served at a given time t is precisely '/2 i>2 Qi W • This quantity can be integrated 
over time to compute the aggregate waiting time in the system. With this insight, we prove the 
following: 

Theorem 3. The aggregate waiting time of customers who arrived over the time period [0, t], 
which we denote Zf, satisfies 

lim lim sup P (Zf > Cy/n) = 0. (6.1) 

C —^ OO 

Since the total number of arrivals to the system in this time is 0 (n) with high probability, the 
waiting time per arrival is O (1 /\fn) with high probability. 

Proof. As noted above, the aggregate waiting time over the period [0, t\ is 



We consider a scaled version Y t n = Zf /yfn. Note that Y t n < t sup 0<s<f J2i>i an d thus 


P(Y t n > C) < P f t sup ^XT(s) > C I 

\ °— s —* i>2 J 

for any constant C > 0. By the weak convergence Xf => JQ and the fact that {t sup 0<s<t J2i> 2 ^ n ( s ) — 
C} is closed, we have 


lim sup P I t sup y, 

o<s<i i>2 


XP(s) >C\< 


sup 
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By continuity of probability and Xi = 0 for i > k we have 


lim P (t sup Xi(s) > C ] = 0. 

C^oo ^ 0 <e<t^ 2 J 

Therefore we conclude 



which proves (16.111 . We note that this implies Y t n = 0(1) with high probability, and thus 
Z" = O (y/n) with high probability. 

Because customers arrive according to a Poisson process with rate A n n = 0 (n), this implies 
the waiting time per arrival is 0 (y/n/n) = 0(1/y/n) with high probability, completing the 
proof. □ 

Theorem [3] does not directly tell us anything about the distribution of the waiting time. 
We can see from considering the system that this waiting time is distributed in a qualitatively 
different way than the standard M/M/n system. Customers immediately enter service if there 
are any idle servers and otherwise wait a constant order amount of time for the previous customer 
in their queue to finish service. Because the aggregate waiting time is of the order O (y/n) and 
and any arriving customers who wait at all incur a constant order waiting time, the total number 
of customers who have to wait is also 0(y/n). As noted above, the total number of arriving 
customers is order n, so the fraction of customers who have to wait is of the order O (1 /y/n). 

7 Open questions. 

Theorem [2] proves that the behavior of the M/M/n -JSQ system in the Halfin-Whitt regime is 
best understood on the order of O (y/n). In particular, the numbers of idle servers and waiting 
customers will both be O (y/n). If long queues are initially present, they will empty in fixed 
time, and no additional long queues will be created. 

Significant questions remain about the steady state behavior of our system. In particular, 
we do not characterize the distribution of the steady state of the limiting system or show that 
the steady state of the ra-th system converges to the steady state of the limiting diffusion process 
(interchange of limits). 

Finally it is always of interest to analyze our system for general interarrival and, especially, 
general service times distribution. We conjecture that the qualitative behavior established in 
this paper in the transient domain and the conjectures above regarding the steady-state behavior 
and the interchange of steady-state limits remain true in this case as well. 
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